Stochastic programming - PDFSEARCH.IO - Document Search Engine

Stochastic programming
Results: 538

#	Item
221	Better Be Lucky Than Good: Exceeding Expectations in MDP Evaluation Thomas Keller and Florian Geißer University of Freiburg Freiburg, Germany {tkeller,geisserf}@informatik.uni-freiburg.de Add to Reading List Source URL: gki.informatik.uni-freiburg.de Language: English - Date: 2015-03-07 08:48:57 Dynamic programming Stochastic control Markov decision process Reinforcement learning Markov chain Valuation Secretary problem Statistics Markov processes Markov models
222	Past, Present, and Future: An Optimal Online Algorithm for Single-Player GDL-II Games 1 ¨ Florian Geißer and Thomas Keller and Robert Mattmuller Add to Reading List Source URL: gki.informatik.uni-freiburg.de Language: English - Date: 2014-08-08 08:44:39 Stochastic control Control theory Search algorithms Partially observable Markov decision process Markov decision process Tree traversal Game theory Statistics Dynamic programming Markov processes
223	Sample Complexity and Performance Bounds for Non-parametric Approximate Linear Programming Jason Pazis and Ronald Parr Department of Computer Science, Duke University Durham, NC 27708 {jpazis,parr}@cs.duke.edu Add to Reading List Source URL: www.cs.duke.edu Language: English - Date: 2013-04-10 17:56:12 Systems theory Dynamic programming Equations Operations research Stochastic control Lipschitz continuity Optimal control S0 Markov decision process Statistics Mathematical optimization Control theory
224	Point-Based Policy Iteration Shihao Ji, Ronald Parr† , Hui Li, Xuejun Liao, and Lawrence Carin Department of Electrical and Computer Engineering † Department of Computer Science Duke University Add to Reading List Source URL: www.cs.duke.edu Language: English - Date: 2007-04-27 18:47:12 Markov processes Stochastic control Control theory Partially observable Markov decision process Valuation Markov decision process Algorithm Function Statistics Mathematics Dynamic programming
225	Journal of Machine Learning Research[removed]1149 Submitted 8/02; Published[removed]Least-Squares Policy Iteration Michail G. Lagoudakis Add to Reading List Source URL: www.cs.duke.edu Language: English - Date: 2003-12-20 17:38:33 Dynamic programming Stochastic control Mathematical optimization Markov processes Numerical analysis Reinforcement learning Markov decision process Approximation Least squares Statistics Mathematical analysis Mathematics
226	Reinforcement Learning with Hierarchies of Machines Ronald Parr and Stuart Russell Computer Science Division, UC Berkeley, CA[removed]parr,russell @cs.berkeley.edu Add to Reading List Source URL: www.cs.duke.edu Language: English - Date: 2012-05-29 14:54:19 Dynamic programming Stochastic control Reinforcement learning Markov decision process Q-learning Machine learning Finite-state machine Markov chain Algorithm Statistics Markov processes Markov models
227	Reinforcement Learning as Classification: Leveraging Modern Classifiers Michail G. Lagoudakis Ronald Parr Department of Computer Science, Duke University, Durham, NC[removed]USA Add to Reading List Source URL: www.cs.duke.edu Language: English - Date: 2003-12-20 17:25:46 Dynamic programming Markov processes Stochastic control Reinforcement learning Support vector machine Markov decision process Supervised learning Temporal difference learning Policy Statistics Machine learning Statistical classification
228	Modular Value Iteration Through Regional Decomposition Linus Gisslen, Mark Ring, Matthew Luciw, and J¨ urgen Schmidhuber IDSIA Manno-Lugano, 6928, Switzerland Add to Reading List Source URL: agi-conference.org Language: English - Date: 2012-12-09 10:12:52 Markov models Stochastic control Markov decision process Reinforcement learning Bellman equation Markov chain Partially observable Markov decision process Statistics Dynamic programming Markov processes
229	Journal of Artificial Intelligence Research[removed]468 Submitted 1/02; published[removed]Efficient Solution Algorithms for Factored MDPs Carlos Guestrin Add to Reading List Source URL: www.cs.cmu.edu Language: English - Date: 2003-10-30 16:25:03 Applied mathematics Mathematical optimization Dynamic programming Markov processes Stochastic control Reinforcement learning Markov decision process Function Algorithm Mathematics Statistics Operations research
230	PROST: Probabilistic Planning Based on UCT Thomas Keller and Patrick Eyerich Albert-Ludwigs-Universit¨at Freiburg Institut f¨ur Informatik Georges-K¨ohler-Allee[removed]Freiburg, Germany Add to Reading List Source URL: gki.informatik.uni-freiburg.de Language: English - Date: 2012-05-24 10:56:58 Stochastic control S0 Markov decision process Planning Domain Definition Language Reinforcement learning Automated planning and scheduling Bellman equation Temporal difference learning Tree Statistics Dynamic programming Markov processes

UPDATE